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17 ABSTRACT 

is Guinea fowl coronavirus (GfCoV) causes fulminating enteritis that can result in 

19 a daily death rate of 20% in guinea fowl flocks. Here we studied GfCoV diversity 

20 and evaluated its phenotypic consequences. Over the period 2014-2016, 

21 affected guinea fowl flocks were sampled in France and avian coronavirus 

22 presence was confirmed by PCR on intestinal content and 

23 immunohistochemistry of intestinal tissue. Sequencing revealed 89% amino 

24 acid identity between the viral attachment protein SI of GfCoV/2014 and the 

25 previously identified GfCoV/2011. To study the receptor interactions as a 

26 determinant for tropism and pathogenicity, recombinant SI proteins were 

27 produced and analyzed by glycan and tissue arrays. Glycan array analysis 

28 revealed that viral attachment SI proteins from GfCoV/2014 and GfCoV/2011 

29 can, in addition to the previously elucidated biantennary diLacNAc receptor, 

30 bind to glycans capped with alpha 2,6-linked sialic acids. Interestingly, 

31 recombinant GfCoV/2014-S1 has an increased affinity for these glycans 

32 compared to GfCoV/2011 -SI, which was in agreement with the increased 

33 avidity of GfCoV/2014-S1 for gastrointestinal tract tissues. Enzymatic removal 

34 of receptors from tissues before applying spike proteins confirmed the 

35 specificity of SI tissue binding. Overall, we demonstrate that diversity in 

36 GfCoV SI proteins results in differences in glycan and tissue binding 

37 properties. 

38 

39 IMPORTANCE Avian coronaviruses cause major global problems in the poultry 

40 industry. As causative agents of huge economical losses, the detection and 

41 understanding of the molecular determinants of viral tropism is of ultimate 
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42 importance. Here we set out to study those parameters and obtained in-depth insight 

43 in the virus-host interactions of guinea fowl coronavirus (GfCoV). Our data indicate 

44 that diversity in GfCoV viral attachment proteins result in differences in affinity for 

45 glycan receptors, as well as altered avidity for intestinal tract tissues, which might 

46 have consequences for its tissue tropism and pathogenesis in guinea fowls. 

47 

48 INTRODUCTION 

49 Avian coronaviruses (AvCoV) pose a major threat to poultry health, production and 

50 welfare worldwide. AvCoVs are highly infectious, remain endemic in poultry 

51 populations and, due to their high mutation rate, frequently produce new antigenic 

52 variants (1, 2). The best-known AvCoV is infectious bronchitis virus (IBV), causing 

53 mainly respiratory disease in chickens. In addition, IBV-like viruses have been 

54 detected in other domestic poultry, including turkey and quail (3-5). In guinea fowl, 

55 coronaviruses have been identified for the first time in 2011 as the causative agent 

56 for fulminating enteritis(6). Full genome sequencing revealed that guinea fowl 

57 coronavirus GfCoV/FFt/2011 is closely associated with turkey coronavirus (TCoV) (7), 

58 both causing gastrointestinal tract infections in their respective host (6, 8). Clinical 

59 signs related to GfCoV infection in guinea fowl include prostration, ruffled feathers, 

60 decreased water and feed consumption, and have resulted in a daily death rate up to 

61 20% in several farms in France (6). Upon necropsy, whitish and enlarged pancreases 

62 were consistently reported. Histopathological analyses revealed pancreatic necrosis 

63 and lesions of various intensities in the intestinal epithelium, with most severe lesions 

64 found in the duodenum of affected animals (6). 

65 
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66 Genetic classification of AvCoVs is based on phylogenetic analysis of the SI domain 

67 of its viral attachment protein spike (2). The spike protein is the main determinant for 

68 tropism (9), and the N-terminal part of the SI of IBV has been shown to contain the 

69 receptor-binding domain (RBD) (10). Studies using recombinant IBV-S1 and/or -RBD 

70 proteins have demonstrated that the viral tropism is reflected by tissue binding of 

71 such proteins (11). Mutations in the spike proteins of IBV might either result in 

72 decreased (10) or increased (12) avidity for its receptor present on epithelial cells of 

73 the chicken trachea. In contrast to IBV, GfCoV and TCoV target the epithelial cells of 

74 the gastrointestinal tract (4, 6), and recombinant protein binding of their SI proteins 

75 reflects this viral tropism, with predominant staining of epithelial cells of the small 

76 intestine (4). Glycan array analysis identified elongated LacNAcs on branched N- 

77 glycans as the host receptor for enteric AvCoVs, which are abundantly expressed on 

78 intestinal tissues (4). 

79 

80 Clinical symptoms in guinea fowl similar to those reported in 2011 are continuously 

81 reported by veterinarians in France (personal communication). However, studies on 

82 newly emerging GfCoVs are particularly hampered by the lack of models to grow the 

83 virus. More specifically, susceptible cell lines have not yet been identified, inoculation 

84 of embryonated guinea fowl eggs did not result in GfCoV production (data not 

85 shown), and SPF guinea fowls are not available for experimental infection. 

86 

87 Here we set out to study the consequences of GfCoV genetic diversity for glycan and 

88 tissue interactions. We revealed that the GfCoV spike gene from the 2014-2016 

89 outbreak in guinea fowl flocks in France was 89% identical to that of GfCoV/2011 (7). 

90 Glycan and tissue binding analyses of GfCoV/2011 and GfCoV/2014 recombinant 
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91 spike SI revealed that, while both proteins had the same specificity, GfCoV/2014-S1 

92 had a much higher affinity toward glycan receptors and tissues of the lower 

93 gastrointestinal tract, in agreement with the observed replication of the virus in these 

94 tissues from field cases. Taken together we demonstrate GfCoV diversity results in 

95 phenotypically different receptor binding properties. 

96 

97 RESULTS 

98 Lesions and coronaviral protein expression in the gastrointestinal tract of 

99 diseased guinea fowls between 2014-2016. 

100 Fulminating disease (peracute enteritis) in guinea flocks continued to be reported 

101 after the initial outbreak of GfCoV infection in 2011 (6). Between February 2014 and 

102 November 2016, duodena from 29 diseased guinea fowls were collected and 

103 analyzed for lesions and coronaviral protein expression. FHistological analysis of 

104 tissues by H&E staining revealed lesions in all duodena, with clear infiltration of 

105 inflammatory cells in remnants of the villi (Fig. 1, black arrowheads). For seven 

106 animals the entire gastrointestinal tract was available for histological analysis, 

107 showing lesions across the entire length of the intestinal tract, including the colon 

108 (Fig 1, black arrowheads). Viral protein expression using antibodies against the M 

109 protein of avian coronaviruses was observed in all duodena and four out of the seven 

no lower intestinal tracts by immunohistochemistry (Fig. 1, white arrowheads). In the 

in colons devoid of expression of viral proteins, the infiltration of inflammatory cells was 

112 noted, suggestive of a previous exposure to a virologic agent. 

113 

114 In contrast to what we observed, virus replication of GfCoV/2011 appeared to be 

115 restricted to the duodenum (6). Unfortunately, we were unable to confirm the lack of 
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116 infection of lower gastrointestinal tract samples in the previous outbreak due to 

117 unavailability of samples. Nevertheless, we here hypothesize that genetically 

118 divergent GfCoVs might have caused phenotypic differences in guinea fowls over the 

119 years. 

120 

121 Circulation of genetically diverse GfCoV. 

122 Gastrointestinal content collected from twenty affected animals between February 

123 2014 and November 2016 were analyzed for the presence of gammacoronavirus 

124 genetic material by one-step real-time RT-PCR using pan-gammacoronavirus 

125 primers (13). For all samples, Ct values obtained were below 35 (data not shown), 

126 confirming the presence of coronaviral RNA in all tested samples (Table 1). Next, 

127 overlapping conventional PCRs were performed with primers based on the spike 

128 gene of the GfCoV/2011 virus (sequences available upon request). Partial SI 

129 sequences could be obtained from ten out of twenty RT-PCR positive samples (893- 

130 1841 nt/ 3624nt for complete S, Tablel), the quality and/or quantity of the remaining 

131 ten samples was too low to generate PCR products. Sanger sequencing of the 

132 obtained fragments confirmed the presence of GfCoV in the intestinal content of all 

133 ten birds, confirming continuous GfCoV circulation in France. 

134 

135 Phylogenetic analysis was performed to investigate the genetic diversity of the 

136 obtained partial SI sequences using Maximum likelihood analyses (Fig. 2). The 

137 results showed that the 2014/2016 sequences clearly clustered with the SI reference 

138 gene from GfCoV/2011 (NCBI HF544506) supported by a bootstrap value of 100, 

139 while they were genetically more distantly related to TCoV. Each of the GfCoV- 
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140 2014/2016 partial SI sequences shared 84-90% nt identity with GfCoV/2011 and 

141 between the 2014-2016 partial SI sequences the variation was 0.1 to 8.0%. 

142 

143 Only from one sample a full spike sequence could be obtained (yCoV/AvCoV/guinea 

144 fowl/France/14032/2014; NCBI MG765535), while for the others the amount and/or 

145 quality of the viral RNA samples were too low for further analyses. Comparison of the 

146 SI gene of GfCoV/2014 with that of GfCoV/2011 using the Kimura 2-parameter 

147 distance model indicated that the genes had an 85% nucleotide and 89% amino acid 

148 sequence identity. Alignment of the amino acid sequences did not indicate clear 

149 mutation hotspots (data not shown) and the huge sequence diversity with IBV-M41- 

150 SI (the only avian coronavirus for which a cryo-EM structure has been elucidated 

151 (14)) impairs further suggestions on the implications of each of the mutations. 

152 

153 GfCoV/2014-S1 recognizes the enteric coronavirus diLacNAc glycan receptor 

154 with higher affinity than GfCoV/2011 -SI. 

155 Using the glycan array of the Consortium for Functional Glycomics, we previously 

156 determined that SI from GfCoV/2011 specifically binds to the diLacNAc glycan 

157 receptors (Gal_1,4GlcNAc_1,3Gal_1,4GlcNAc) (4). To study whether the observed 

158 changes in the spike of GfCoV/2014 resulted in differences in recognition of this 

159 glycan receptor, we recombinantly produced GfCoV/2014-SI and GfCoV/2011-SI 

160 and applied both proteins to diLacNAc-PAA conjugates in an ELISA as previously 

161 described (4). At similar protein concentrations GfCoV/2014-SI showed improved 

162 binding to this receptor (Fig. 3), indicating that the mutations in SI did not affect the 

163 specificity, but resulted in significant higher affinity, for this particular receptor. 

164 
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165 The genetic differences between GfCoV/2014 and /2011 did not alter glycan 

166 specificity. 

167 Next, we investigated whether the mutations in SI resulted in recognition of 

168 additional N-linked glycans. To this end, both SI proteins were applied to a novel 

169 glycan array containing N-glycan structures with their linear counterparts, either with 

170 terminal galactose or two differently linked sialic acid moieties (F. Broszeit and R.P. 

171 de Vries, submitted for publication). Schematic representations of each of the 

172 glycans are given in Fig. 4A. The data revealed that both GfCoV-SI proteins bind to 

173 longer biantennary LacNAc structures (Fig. 4B, structures #3-4), including the 

174 diLacNAc structure used in the ELISA (Fig. 3). Furthermore, both GfCoV-SI proteins 

175 bound to longer linear LacNAc repeats (Fig. 4B, structure #1), which were not 

176 included in the previous array (4). Finally, both GfCoV-SI proteins bound longer 

177 linear and biantennary LacNAc repeats with terminal alpha 2,6 sialic acid (Fig. 4B, 

178 structures #9-12), but not those capped with alpha 2,3 linked sialic acids (Fig. 4B, 

179 structures #5-8). Erythrina cristagalli lectin (ECA), Sambucus nigra lectin (SNA) and 

180 Maackia Amurensis Lectin I (MALI) were taken along as controls. We observed as 

181 expected specific binding to galactose, alpha 2,6 linked and alpha 2,3 linked sialic 

182 acid terminal glycans, respectively (Fig. 4C). In conclusion, both GfCoV-SI proteins 

183 show specificity for the same glycans, ending with either galactose or alpha 2,6 

184 linked sialic acids on the glycan array. However, the relative fluorescence observed 

185 for GfCoV/2014-S1 was consistently higher when compared to GfCoV/2011-SI, 

186 which is suggestive for differences in affinity for glycan receptors, as was observed 

187 for diLacNAcs in Fig. 3. 

188 
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189 GfCoV/2014-S1 has a higher affinity for glycan receptors compared to /2011-SI. 

190 To allow comparison of the binding affinities of both proteins for each glycan, we 

191 applied fivefold serial SI protein dilutions onto the glycan array and compared 

192 binding intensities at various scan powers. At each concentration, for all glycans 

193 shown in Figure 4A, binding signals of GfCoV/2014-S1 (Fig. 5A) were consistently 

194 higher than those of GfCoV/2011-SI (Fig. 5B). Detection of linear glycan binding 

195 (glycan #1 and #9) required higher concentrations and scan powers compared to the 

196 detection of biantennary LacNAc structures (glycans #3-4 and #11-12) for both 

197 proteins. Interestingly, binding intensity of GfCoV/2011-SI to glycans with terminal 

198 alpha 2,6 sialic acids was less compared to binding to glycans with terminal 

199 galactose (Fig. 5B compare structures #3-4 to #11-12 in 100 pg/mL to 20 pg/mL). 

200 This difference in preference for galactose-terminal glycans was not observed for 

201 GfCoV/2014-SI, since binding to glycan structures #3-4 and #11-12 was similar in 

202 each dilution applied to the array (Fig. 5A). Taken together, the data indicate that 

203 GfCoV/2014-SI has a higher affinity for all glycans bound on the array compared to 

204 /2011-SI. 

205 

206 GfCoV/2014-SI has a broader gastrointestinal tract tropism. To reveal whether 

207 the observed differences in glycan binding properties of the SI proteins have 

208 biological consequences for tissue tropism, we first determined whether the identified 

209 glycans are indeed present on gastrointestinal tract tissues of healthy, uninfected 

210 guinea fowl. Both SNA and ECA lectins stained the epithelial lining of the duodenum, 

2 11 jejunum and caecum intensely, while intermediate staining of the proventriculus and 

212 colon was observed. In the pancreas only limited binding of SNA was observed, with 

213 no staining by ECA; in contrast, in the ileum ECA strongly bound whereas SNA 
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214 bound only to a limited extend. In conclusion, all tissues of the gastrointestinal tract, 

215 except cloaca, express GfCoV glycan receptors (Table 2) (15). 

216 

217 Next, we investigated the binding patterns of GfCoV-SI proteins to gastrointestinal 

218 tissues. Both proteins stained the epithelial cells of almost the entire gastrointestinal 

219 tract (duodenum and colon in Fig. 6 1 st column; summary of results in Table 2), 

220 indicating that receptors present on the tissues allow binding of SI. Interestingly, 

221 staining intensities of the lower intestinal tract (ileum, caecum, colon) were much 

222 more apparent for GfCoV/2014-S1 than for GfCoV/2011-SI. This prompted us to 

223 analyze avidity and specificity to glycan receptors in the guinea fowl gastrointestinal 

224 tissues by GfCoV-SI proteins. We therefore pre-treated tissue slides with 

225 Arthrobacter ureafaciens neuraminidase (AUNA) and/or galactosidase to cleave off 

226 terminal sialic acids and galactose residues from host glycans, respectively. 

227 Treatment of the tissues with AUNA had only a minor effect on the binding of both 

228 GfCoV-SI, with a slight decrease in binding intensity to the duodenum for 

229 GfCoV/2014-S1 (Fig. 6A 2 nd column; Table 2). SNA lectin binding was completely 

230 abolished after pre-treatment with AUNA, confirming that the treatment did effectively 

231 cleave off all sialic acids from the host glycans (Table 2). 

232 

233 When galactose residues were removed from the tissues by treatment with 

234 galactosidase prior to applying ECA, binding was severely reduced or totally absent 

235 (Table 2). Binding of GfCoV/2011-SI to the tissue was completely abolished (Fig. 6 

236 3 rd column; Table 2), indicating that GfCoV tissue engagement is almost exclusively 

237 dependent on the presence of galactose-terminating glycans. On the other hand, 

238 GfCoV/2014-SI still clearly bound to the epithelial cells of the intestinal tract, 
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239 indicating a significant difference in receptor binding avidity (Fig. 6 3 rd column; Table 

240 2). 

241 

242 Finally, tissues were simultaneously pre-treated with AUNA and galactosidase to 

243 remove both galactose and sialic acids from the glycans of the host. Indeed binding 

244 of both ECA and SNA were strongly reduced (Table 2). Tissue binding of 

245 GfCoV/2011-SI was completely prevented, while GfCoV/2014-S1 still clearly bound 

246 to the epithelial cells of the gastrointestinal tract (except pancreas) (Fig. 6 4 th column; 

247 Table 2). These results suggest that either a minor amount of receptors is still 

248 present, or that yet an additional (glycan) receptor is involved in tissue binding of 

249 GfCoV/2014-SI. 

250 

251 DISCUSSION 

252 In this study we demonstrated ongoing GfCoV circulation in guinea fowl flocks in 

253 France. The sequence diversity between the viral attachment proteins of GfCoV 

254 circulating in 2011 and 2014 resulted in differences in receptor binding properties 

255 with profound phenotypic consequences. This relationship between these findings 

256 and in vivo pathogenesis can, however, only be elucidated in detail when new 

257 models to study this virus have been developed. 

258 

259 An amino acid sequence identity of 89% between viruses circulating only several 

260 years apart might indicate suggest that either a novel GfCoV strain was introduced in 

261 France from a yet unidentified source, or that there was high evolutionary pressure 

262 on the 2011 GfCoV strain. High mutation rates for avian coronaviruses are not 

263 uncommon (based on full genome sequences around 1.2x10" 3 substitution/site/year 
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264 (16, 17)). When comparing GfCoV/2011 and /2014-SI sequences, the calculated 

265 mutation rate was 5x1 O' 2 substitution/site/year with a dN/dS ratio of 0.45. Similar 

266 mutation rates of the spike have been reported for IBV (18) and are believed to be 

267 driven by selective pressure after vaccination (19, 20). However, no vaccine is 

268 available against GfCoV, nor against the closely related turkey coronavirus, TCoV. 

269 Another driver for genetic diversity is the population size (21), however, this is 

270 unlikely to explain the observed fast mutation rate of GfCoV since flocks are 

271 considerably smaller compared to chicken flocks. It might well be that circulating 

272 antibodies against field strains of GfCoV are main drivers of the observed sequence 

273 diversity. Unfortunately, retrospective studies to further elucidate the contribution of 

274 virus evolution, the circulation of other virus populations in the last years, or 

275 introduction of novel strains via for example trade of birds between farms, are 

276 impossible due to the lack of archive material. 

277 

278 Here we revealed a novel glycan receptor for GfCoV, the first coronavirus that binds 

279 N-glycans capped with alpha 2,6 linked sialic acids. Alpha 2,6 sialic acid presence 

280 has been reported previously in guinea fowl large intestine (15), as well as the 

281 previously elucidated poly-LacNAc expressed in guinea fowl small intestine (4). 

282 Together, their expression patterns can explain in large part the tropism of GfCoV, 

283 but it does not exclude, together with the results presented in this manuscript, that 

284 yet another host factor plays a role in GfCoV/2014 infection. Initial attempts to show 

285 whether protein receptors, required for infection of many other coronaviruses (22-24), 

286 are required were yet unsuccessful (data not shown). 

287 
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288 While spike protein binding analyses suggest phenotypic differences between these 

289 viruses in vivo, the reported gross clinical signs in field cases between 2011 and 

290 2014 were not markedly different. Attempts to study the pathogenesis of GfCoV/2014 

291 by inoculating commercial guinea fowls with GfCoV-containing fecal samples did, 

292 unfortunately and in contrast to a previous study (6), not result in manifestations of 

293 clinical signs or convincing detection of viral RNA by RT-QPCR (data not shown). 

294 Whether this was due to previous exposure of commercial birds to GfCoV and hence 

295 circulating antibodies preventing the infection remains to be investigated. 

296 

297 Here, we have demonstrated that GfCoV/2014-S1 has higher affinity for glycan 

298 receptors and increased avidity for the lower gastrointestinal tract compared to 

299 GfCoV/2011-SI. The viral genetic diversity between these spikes and the 

300 implications for receptor recognition further add to our understanding of this virus for 

301 which models are basically lacking. 

302 

303 MATERIAL AND METHODS 

304 Collection of field samples. Samples were collected from guinea fowls showing 

305 enteritis and concomitant high mortality (>10%) in flocks in five regions in France 

306 (Bretagne, Pays de Loire, Nouvelle-Aquitaine, Occitanie, and Auvergne-Rhone- 

307 Alpes) from February 2014 through November 2016. Gastrointestinal content was 

308 collected and stored at -80°C for viral RNA isolation. Tissues (duodenum, pancreas, 

309 airsac, lung, ‘small intestine’, large intestine, kidney, cloaca, trachea and bursa) were 

310 collected during necropsy, fixed for 24h in 4% buffered formaldehyde (m/v) and 

311 stored in 70% ethanol. 

312 
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313 Immunohistochemistry. Paraffin-embedded tissues were sliced at 4pm and 

314 deparaffinized in xylene and rehydrated in an ethanol gradient from 100%-70%. 

315 Antigen retrieval was carried out in Tris-EDTA pH 9,0 (preheated) before applying 

316 1% H 2 O 2 in methanol. After washing twice in Normal antibody diluent (Immunologic) 

317 mAb mouse anti IBV M protein 25.1 (Phonics, Lelystad, The Netherlands), cross 

318 reacting with TCoV and GfCoV (5) was applied for 1 hour at room temperature (RT). 

319 Slides were washed in PBS-0,1%Tween and EnVision kit (cat. no. K4001; Dako) was 

320 used for anti-mouse secondary antibody staining according to the manufacturers 

321 protocol. Slides were washed three times in PBS and viral M-protein presence was 

322 visualized with AEC. The tissues were counterstained with hematoxylin and mounted 

323 with AquaMount (Merck). 

324 

325 Molecular characterization of GfCoV. The gastrointestinal content collected from 

326 affected guinea fowl was clarified by centrifugation (30 sec at ll.OOOxg), and RNA 

327 was extracted using a Qiagen Viral RNA extraction kit following the instructions of the 

328 manufacturer. A one-step real-time RT-PCR targeting the avian coronavirus N-gene 

329 was carried out to confirm the presence of coronavirus RNA as previously described 

330 (13). Subsequently, the isolated RNA was reverse transcribed using the Revertaid kit 

331 with random hexamers (Thermo Fisher, Waltham, MA), and overlapping conventional 

332 PCRs were performed to amplify the guinea fowl S-gene (primer sequences available 

333 upon request). Sanger sequencing of the resulting fragments was performed using 

334 PCR primers. Contigs were generated with BioEdit (version 7.0.8.0) (25) and 

335 submitted to NCBI. Muscle (26) was used for the alignment, and Mega (version 6.06) 

336 with bootstrap value of 1000 for the phylogeny (27). Selective pressure was 
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337 calculated as dN/dS, and the dN=dS hypothesis was tested using Pamilo-Bianchi-Li 

338 method (28) with a p<0.05 considered statistically significant. 

339 

340 Construction of the expression vector. The codon-optimized sequence for 

341 GfCoV/2014-S1 (yCoV/AvCoV/guinea fowl/France/14032/2014; NCBI MG765535), 

342 containing an upstream Nhe\ and downstream Pad restriction site, was obtained 

343 from GenScript and cloned into the pCD5 expression vector by restriction digestion 

344 (as previously described (29)). The SI sequence is in frame with a C-terminal GCN4 

345 trimerization motif and Strep-Tag. The expression vector encoding GfCoV/2011-SI 

346 was generated previously (29). 

347 

348 Production of recombinant proteins. Recombinant SI proteins were expressed by 

349 transfection of human embryonic kidney (HEK293T) cells with pCD5-expression 

350 vectors using polyethylenimine (PEI) at a 1:12 (w/w) ratio. Cell culture supernatants 

351 were harvested after six days. The recombinant proteins were purified using Strep- 

352 Tactin sepharose beads as previously described (29). 

353 

354 ELISA. GaM ,4GlcNAc_1,3Gal_1,4GlcNAc (Consortium for Functional Glycomics), 

355 was coated in a 96-well maxisorp plate (NUNC, Sigma-Aldrich) at 0.5 pg/well 

356 overnight at 4°C, followed by blocking with 3% BSA (Sigma) in PBS-0.1% Tween. SI 

357 proteins were pre-incubated with Strep-Tactin HRPO (1:200) for 30 minutes on ice. 

358 For each protein, 2-fold dilutions were made in triplicate in PBS, and applied onto the 

359 coated well, followed by incubation for 2 hours at room temperature. TMB (3,3’,5,5’- 

360 tetramethylbenzidine, Thermo Scientific) substrate was used to visualize binding, 

361 after which the reaction was terminated using 1M H2S04. Optical densities 
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362 (OD450nm) were measured in FLUOstar Omega (BMG Labtech), and MARS Data 

363 Analysis Software was used for data analysis. Statistical analysis was performed 

364 using a 2-way ANOVA. 

365 

366 Glycan array. Glycan structures were printed in six replicates on glass slides 

367 (NEXTERION® Slide H, Schott Inc.). Prelabeled SI-proteins with Alexa647-linked 

368 anti-Strep-tag mouse antibody and with Alexa647-linked anti-mouse IgG (4:2:1 molar 

369 ratio) were applied to the slides (concentrations in figure legends) and incubated for 

370 90 minutes, after which the slides were washed with PBS and deionized water, dried 

371 and imaged immediately. 

372 As controls different lectins were applied: Erythrina cristagalli agglutinin /ECA), which 

373 is specific for glycans with terminal galactose, N-acetylgalactosamine, or lactose and 

374 Sambuca nigra agglutinin (SNA) and Maackia Amurensis Lectin I (MALI) which are 

375 specific for alpha 2,6 linked and alpha 2,3 linked sialic acids attached to terminal 

376 galactose respectively. Of the six replicates, the highest and lowest value were 

377 removed, and of the remaining four the total signal and SD values were calculated 

378 and plotted in bar graphs or heatmaps. 

379 

380 Spike histochemistry. Spike histochemistry was performed as previously described 

381 (29). SI proteins pre-complexed with Streptactin-HRPO were applied onto 4 pm 

382 sections of formalin-fixed paraffin embedded healthy guinea fowl tissues and binding 

383 was visualized using 3-amino-9-ethyl-carbazole (AEC; Sigma-Aldrich). Proteins were 

384 applied onto slides at 5 pg/ml. Where indicated the tissues were treated per slide with 

385 40U (3-galactosidase (Gal; Megazyme, USA) or 2 mU Neuraminidase (Sialidase) 
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386 from Arthrobacter ureafaciens (AUNA, Sigma, Germany) in 10 mM potassium 

387 acetate, 2,5 mg/ml TritonXIOO, pH 4.2 at 40°C O/N before protein application. 

388 

389 Lectin histochemistry. Lectin histochemistry was performed as previously 

390 described (4). Biotinylated- Erythrina cristagalli lectin or Biotinylated-Sambucus nigra 

391 lectin (both Vector Laboratories) were diluted in PBS to a final concentration of 2 

392 pg/ml (ECA) or 6 pg/ml (SNA) and applied to healthy guinea fowl tissue sections for 

393 30 min. After washing with PBS the signal was visualized by an Avidin-Biotin 

394 complex (ABC kit; Vector Laboratories) and counterstained with hematoxylin. 

395 

396 Data availability. Contigs are available in GenBank under accession numbers 

397 MG765535 to MG765542 and MK290733 to MK290734. 

398 
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499 FIGURE LEGENDS 

500 Figure 1. (Immuno)histological analyses of guinea fowl intestinal tract. 

501 Representative images of duodenum and colon from a guinea fowl presented with 

502 peracute enteritis in 2014 after staining with H&E (left) or antibodies against the M 

503 protein of infectious bronchitis virus, known to cross react with GfCoV-M protein in 

504 immunohistochemistry (IHC, right). Black arrowheads indicate inflammatory cells and 

505 white arrowheads indicate viral protein expression. 
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506 

507 Figure 2. Molecular phylogenetic analysis by Maximum Likelihood method 

508 comparing GfCoV (partial) spike sequences. Phylogenetic tree based on the 

509 Kimura 2-parameter model, in which bootstrap values are shown next to the 

5 10 branches. The analysis involved 22 nucleotide sequences. All positions containing 

511 gaps and missing data were eliminated. There were a total of 893 nucleotide 

512 positions in the final dataset. Evolutionary analyses were conducted in MEGA6. * 

513 indicate partial SI sequences of GfCoV. 

514 

515 Figure 3. Binding of GfCoV-SI to the enteric coronavirus glycan receptor 

516 diLacNAc. Concentration-dependent binding of GfCoV-SI proteins to 

517 Gal_1,4GlcNAc_1,3Gal_1,4GlcNAc in ELISA. As negative control, IBV-M41-NTD 

518 was taken along (10); 1: significant difference between GfCoV-SI and IBV-M41, 2: 

519 significant difference between GfCoV/2014-S1 and GfCoV/2011-SI (p<0.001). 

520 

521 Figure 4. Glycan binding specificity of guinea fowl SI proteins. Schematic 

522 representation of selected glycan structures present on the glycan array; numbers 

523 correspond to those shown in the graphs (A). Number 1-4 represent glycans ending 

524 with galactose, number 5-8 glycans capped with alpha 2,3 linked sialic acids, number 

525 9-12, glycans capped with alpha 2,6 linked sialic acids. Glycan receptor specificity of 

526 GfCoV-SI proteins (B) and lectins ECA, MALI and SNA (C) in glycan array assay (F. 

527 Broszeit and R.P. de Vries, submitted for publication); RFU: relative fluorescent units; 

528 yellow circle: galactose, blue square: GIcNAc, green circle: mannose, pink diamond: 

529 NeuAc. 

530 
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531 Figure 5. Glycan binding affinity of guinea fowl SI proteins. Glycan binding of 

532 GfCoV/2014-S1 (A) and GfCoV/2011-SI (B) are shown as heatmaps with 5-fold 

533 dilutions (100 pg/nnL to 4 pg/rnL) of the proteins applied to glycan array slides that 

534 are scanned with different laser intensities. RFU: relative fluorescent units; glycan 

535 numbers correspond to schematic representations shown in Figure 4A. 

536 

537 Figure 6. Binding of GfCoV-SI proteins to guinea fowl duodenum and colon 

538 without and with enzymatic pretreatment of the tissues. Spike histochemistry 

539 was performed on uninfected, healthy duodenum (A) and colon (B) tissues without 

540 and with pre-treatment of enzymes (AUNA and/or galactosidase) before applying 

541 GfCoV/2014-SI and GfCoV/2011-SI. Binding of proteins was visualized by red 

542 staining. 

543 TABLE 1 Overview of selected guinea fowls and obtained GfCoV spike sequences. 

544 Animals with bold animal numbers were included for immunohistological examination 

545 as well. *ND = unknown 

546 

547 TABLE 2 Relative binding of viral proteins and lectins on guinea fowl intestinal 

548 tissues. 

549 White box indicates no visible staining, light grey box indicates light to mild staining 

550 and/or not all epithelial cells show staining, dark grey indicates intense staining, most 

551 of the epithelial cells are showing positive signal, na = not analyzed 
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TABLE 1 Overview of selected guinea fowls and obtained GfCoV spike sequences. 
Animals with bold animal numbers were included for immunohistological examination as 
well 


Animal 

number 

Date of 
sample 
collection 
(week / year) 

Age at 
sampling 
time 
(weeks) 

Accession 

number: 

(Spike sequence) 

Nt identity (%) 
with 

GfCoV/2011-SI 


2011 


LN610099.1 
nt: 1-3708 

100 

14-002 

6/2014 

10 



14-013 

15/2014 

8 



14-032 

22/2014 

7 

MG765535 
nt: 1-3669 

85 

14-036 

24/2014 

7 



14-037 

25/2014 

7 



14-039 

26/2014 

5.5 



14-040 

23/2014 

ND* 

MG765536 
nt: 1-1392 

88 

14-041 

23/2014 

ND* 

MG765537 
nt: 1-1771 

88 

14-042 

23/2014 

ND* 

MG765538 
nt: 1-1392 

88 

14-047 

33/2014 

3 

MG765539 
nt: 1-1378 

88 

14-053 

37/2014 

9 

MG765540 
nt: 1-1393 

88 

14-065 

44/2014 

12 



14-066 

45/2014 

4 

MG765541 
nt: 1-1384 

88 

15-006 

3/2015 

ND* 

MG765542 
nt: 1-980 

87 

15-116 

46/2015 

7 



15-118 

47/2015 

8 



16-086 

38/2016 

ND* 

MK290733 
nt: 1-2465 

85 

16-115 

45/2016 

4 



16-123 

47/2016 

ND* 

MK290734 
nt: 571-1895 

86 


*ND = unknown 
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TABLE 2 Relative binding of viral proteins and lectins on guinea fowl intestinal 
tissues. 


Treatment 


Tissue 


AUNA 

Galactosidase 


proventiculus 

duodenum 

pancreas 

jejunum 

ileum 

ceacum 

colon 

cloaca 



White box indicates no visible staining, light grey box indicates light to mild staining and/or not all epithelial cells 
show staining, dark grey indicates intense staining, most of the epithelial cells are showing positive signal, na = 
not analyzed 
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